Scientific Data — Latest Matching Preprints

1

Development of an Open-Access Action Observation Video Library for Upper Limb Motor Rehabilitation

Madison, M.; Wheaton, L. A.; Rowe, V.

2026-06-10 rehabilitation medicine and physical therapy 10.64898/2026.06.10.26355108 medRxiv

Top 0.9%

1.9%

Show abstract

Background: Occupational therapists can improve stroke survivors hand and arm movement and participation in daily activities through action observation (AO). AO involves watching another persons hand or arm complete a movement or task. While research generally supports the use of AO with stroke survivors, there are limited AO videos are available to occupational therapists which makes applying AO challenging. Objective: The purpose of this work is to develop structured and widely accessible tool to support access to AO for stroke survivors, occupational therapists, and researchers. Methods: To develop an AO video library for stroke rehabilitation, functional and non-functional upper limb task deficits were first identified through clinical observations and clinician interviews to establish a prioritized list of daily activities. In collaboration with media production specialists, healthy adult volunteers were recruited and filmed performing these tasks from both first- and third-person perspectives. The recorded videos were then systematically edited, enhanced with instructional title slides, and distributed via a public YouTube channel for clinical application and a categorized digital repository for research purposes. Results: Initial assessments revealed a complete lack of familiarity, awareness, and utilization of AO resources among local occupational therapists, despite high perceived clinical utility. To address this gap, a final library of 150 tasks was established, resulting in the production of 419 finalized, standardized videos featuring six healthy volunteers. For clinical application, these videos were hosted on a free, public YouTube channel organized into 18 functional playlists, while a parallel set was structured into distinct movement categories for research repository storage. Conclusion: By providing a structured and highly accessible tool, this repository enables clinicians, researchers, and caregivers to readily implement evidence-based action observation interventions in both clinical and home settings.

2

Combining centralized and decentralized approaches to assess and ensure data quality in Eurocrine(R) via Microsoft Power BI and DataquieR

Musholt, T. J.; Clerici, T.; Bergenfelz, A.; Schmidt, C. O.; Struckmann, S.

2026-06-05 health informatics 10.64898/2026.06.04.26354884 medRxiv

Top 1%

1.6%

Show abstract

Background: Medical registries have gained importance in the evaluation of healthcare quality outcomes. In the absence of high-quality evidence, such as randomized controlled trials, studies based on registry data are essential for informing clinical guidelines. Methods for assessing data quality are rarely described in detail. To ensure the credibility of registry-based studies, registries must use all available technical and operational means to guarantee high data quality. Method: Eurocrine(R) is a pan-European endocrine surgical database and quality registry initially funded by the EU healthcare programme, which started in 2015 and now includes more than 200,000 interventions as of April 2025. To ensure high data quality, interactive and standardized reports are created via Microsoft Power BI, which are created both centrally and locally. In addition, comprehensive data quality analyses were performed via the R-based package dataquieR. Results: Although a multitude of technical measures (for example, input screen design and real-time plausibility checks during data entry) are in place, they are not sufficient to prevent human errors at data entry. Errors identified in the reports were corrected, and preventive measures were implemented. Overall, the data quality was assessed as very good in terms of completeness, accuracy, and consistency. Conclusion: It is very important to provide registry users with an efficient and smart tool to identify data issues, as they have the clinical information to correct them. Data quality reports generated with dataquieR represent an effective tool for registry administrators. Predesigned Microsoft Power BI reports enable participating Eurocrine(R) clinics to self-audit their data.

3

Multi-region sampling of the human small intestine using an ingestible device

Fu, B.; DeSchepper, L. B.; Sun, J.; McKeithen-Mead, S. A.; Kapili, B.; Ochoa-Andersen, P.; Spencer, S. P.; Fardeen, T.; Ricardo, M.; El Kamari, V.; Sinha, S.; Relman, D. A.; Grembi, J. A.; Shalon, D.; Estrela, S.; Huang, K. C.

2026-06-10 gastroenterology 10.64898/2026.06.09.26353912 medRxiv

Top 1%

1.4%

Show abstract

The human small intestine (SI) plays a central role in nutrient processing, host-microbe interactions, and immune regulation, yet remains poorly characterized due to the lack of minimally disruptive sampling methods. Here, we present a protocol for deploying, recovering, and analyzing samples collected using an ingestible device that enables multi-region, lumen-targeted SI sampling during normal digestion. The device incorporates a ~30-cm collapsible tube wound into pH- or time-responsive layers that sequentially unfurl in situ, typically capturing three spatially ordered samples with high yield and reliable retrieval. This protocol outlines study design, participant handling, device recovery, contamination control, and standardized workflows for analyses, including cell quantification, culturomics, sequencing, and metabolomics. We further describe benchmarking approaches for evaluating spatial resolution and strategies for assay prioritization when sample volume is limiting. By reducing participant burden and facilitating integration with stool, saliva, and clinical metadata, this approach enables longitudinal and large-cohort studies linking SI microbial ecology and host physiology to human health.

4

BodyMAE: A Surface-Area Aware Masked Autoencoder for Body Composition Estimation from 3D Body Scans

Zheng, Y.; Feng, B.; Cheng, R.; Qiu, C.; Long, Z.; Vaziri, K.; Hahn, J.

2026-06-06 health informatics 10.64898/2026.06.04.26354925 medRxiv

Top 2%

1.2%

Show abstract

Accurate assessment of body composition is important to risk stratification and management of metabolic, musculoskeletal, and aging-related diseases, yet reference modalities such as Dual-energy X-ray absorptiometry (DXA) are costly and impractical for frequent monitoring. Commodity 3D body scans offer a low-cost, radiation-free alternative, but extracting meaningful and predictive shape features from scans remains challenging due to nonuniform point density, variable body size and cross-device differences. We introduce BodyMAE, a self-supervised, surface-area aware masked autoencoder for metric-scale 3D body scans. The pipeline integrates area-adjusted sampling, a long-range focused encoder, and a lightweight decoder regularized to promote locally uniform reconstructions. Trained and evaluated on 917 paired 3D body scans paired with clinical DXA reports, BodyMAE achieves strong accuracy on fat percentage (root-mean-square error (RMSE) 3.825 percentage points, R^2 0.908), fat mass (RMSE 3.694 kg, R^2 0.968), and lean mass (RMSE 3.608 kg, R^2 0.901), with competitive performance on bone mineral content (RMSE 0.284 kg, R^2 0.754).We also assess feature stability across pretrained baselines, finding higher retrieval accuracy for our representations (Top-1 90.131%). These results indicate that combining metric-aware sampling, long-range relational encoding, and local geometric regularization enables accurate body composition estimation from 3D body scans, as validated by comparisons to DXA-derived measurements.

5

The impact of B1+ inhomogeneity on image quality metrics and morphometric statistical inferences at 7 T MRI

Liu, K.; Uludag, K.; de Coo, I. F. M.; Smeets, H. J. M.; Jansen, J. F. A.; Formisano, E.; Poser, B. A.; Haast, R. A. M.; Ivanov, D.

2026-06-09 radiology and imaging 10.64898/2026.06.08.26355136 medRxiv

Top 2%

0.9%

Show abstract

Introduction: Structural neuroimaging relies on T1-weighted (T1w) magnetic resonance imaging (MRI) for brain morphometry, yet at 7 Tesla (7 T) transmit field (B1+) inhomogeneity remains a major source of bias. Although Magnetization Prepared 2 Rapid Acquisition Gradient Echoes (MP2RAGE) improves the tissue contrast, residual B1+ effects may persist and may be exacerbated in aging or clinical populations, where anatomical and physiological factors further challenge image quality and preprocessing. The impact of B1+ inhomogeneity on automated quality assessment and morphometric statistical inference remains insufficiently understood. Methods: Submillimeter 7 T MP2RAGE brain acquisitions from carriers of a mitochondrial gene mutation (m.3243A>G) and controls were retrieved from previous studies. Image quality before and after B1+ inhomogeneity correction was assessed by multiple automated pipelines. Case-control morphometric studies, including regional volume and mean cortical thickness, were analyzed in both registration based and deep learning based segmentation frameworks. Changes in image quality metrics (IQMs) and morphometric statistical significance were evaluated to determine the impact of B1+ inhomogeneity correction. Results: Overall image quality rating and metrics sensitive to intensity non-uniformity and topological integrity consistently improved after B1+ inhomogeneity correction. However, its impact on morphometric statistical inferences was strongly method-dependent. Some pipelines showed redistribution of significant regions, whereas others predominantly demonstrated increased effects in sensitivity. Across methods, B1+ inhomogeneity correction altered the findings of morphometric analyses, particularly in cortical regions. Conclusion: Residual B1+ inhomogeneity at 7 T substantially influences both image quality control and morphometric evaluations. Current automated quality control approaches can hardly capture these effects reliably. B1+ inhomogeneity correction will not only improve intensity uniformity, but also change sensitivity of morphometric statistical inferences. To establish reliable morphometric biomarkers at UHF strengths, explicit B1+ correction and customized preprocessing are practically necessary and highly recommended.

6

Next-Generation Skin Cancer Detection Using Efficient Fuzzy Fusion of Genomic and Imaging Data

Molla, A. R.; Maity, A.; Saha, S.; Bhattacharya, R.; Chakraborty, A.; Biswas, S.; Nath, S.

2026-06-08 health informatics 10.64898/2026.06.05.26355024 medRxiv

Top 2%

0.8%

Show abstract

Skin cancer requires early detection for improved survival rates. Most existing methods rely on deep learning based image classification, which is affected by visual similarity among lesions. Fewer studies use Gene Expression (GE) analysis, which captures molecular characteristics but lacks structural and visual details. To overcome limitations of individual modalities, this paper proposes a multimodal framework integrating dermoscopic images and GE profiles for skin cancer classification. EfficientNet and logistic regression are used for image based analysis and genomic skin lesion profiling, respectively, followed by fuzzy rule based decision systems to reduce uncertainty within individual modalities. Finally, fuzzy fusion combines predictions from both modalities using uncertainty based weighting of classifier outputs. The experimental findings show that both the image based and GE based classification models individually achieved accuracies of nearly 92%. However, the integration of prediction results through the proposed fuzzy fusion strategy further enhanced the classification performance, achieving an overall accuracy of 94.25%. The results obtained outperform contemporary methods, highlighting the effectiveness of combining complementary multimodal information compared with single modality approaches.

7

Shifting patterns of importation risk of Bundibugyo Ebola virus disease to Europe under outbreak expansion scenarios

Fanelli, F.; Parino, F.; Poletto, C.; Colizza, V.

2026-06-04 public and global health 10.64898/2026.05.31.26354511 medRxiv

Top 2%

0.7%

Show abstract

The 2026 Bundibugyo Ebola outbreak in eastern Democratic Republic of the Congo (DRC) has already generated international spread to Uganda, raising concerns about further regional and international dissemination. Using International Air Transport Association origin-destination passenger flows, we assessed relative exposure to Ebola virus disease importation into Europe under six outbreak expansion scenarios reflecting plausible pathways of geographical spread, including cross-border transmission and amplification in highly connected regional capitals. Relative exposure patterns remained largely unchanged under localized transmission in eastern DRC and border-spillover scenarios. Expansion into South Sudan generated a first structural increase in importation pressure to Europe through the connectivity associated with Juba, while hypothetical amplification in Kampala, Kigali, and Kinshasa substantially increased importation pressure and reshaped exposure patterns across Europe. Across all scenarios, France, Italy, and the United Kingdom remained among the most exposed countries. Mobility-informed scenario analyses support preparedness as the geography of the outbreak evolves.

8

A risk-of-contagion index using a Bayesian based model for the COVID-19 epidemic in Mexico

Corona-Moreno, R.; Acuna-Zegarra, M. A.; Santana-Cibrian, M.; Velasco-Hernandez, J. X.

2026-06-10 health policy 10.64898/2026.06.09.26355274 medRxiv

Top 3%

0.7%

Show abstract

During the COVID-19 pandemic, limited testing capacity and reporting delays complicated epidemic surveillance and decision-making in Mexico. We calibrated \textit{covidestim}, a Bayesian nowcasting model, to estimate the total SARS-CoV-2 infections from reported cases and deaths using Mexican surveillance data. Disease-progression distribution priors were calibrated using Mexico City records and validated through comparisons with national seroprevalence surveys, hospitalization data, and annual reported severe-case rates across all states. Using the reconstructed estimates of active infections, we implemented an event-based risk framework that quantifies the probability of encountering at least one infectious individual in gatherings of different sizes. This probability was subsequently translated into a four-level epidemiological traffic-light indicator and computed at both state and municipality levels. The resulting estimates revealed substantial spatial heterogeneity that is obscured by state-level aggregation, particularly in states with marked differences between urban and rural municipalities. To evaluate consistency with public-health indicators, we compared the proposed risk classification with the official Mexican epidemiological traffic-light system, considering interpretable gathering sizes relevant to public-health decision making. Weekly reports derived from this framework were delivered to policymakers in the State of Queretaro in Mexico, as an anticipation tool for school reopening and public-space management. This demonstrates that this Bayesian reconstruction of infections combined with event-based risk metrics can provide an interpretable and generalizable municipality-level complement to routine surveillance systems, particularly in regions with limited testing capacity and heterogeneous local transmission dynamics.

9

PhysiCase: Development and dual-layer validation of synthetic cases for health professional education: A pilot study leveraging Generative AI

Komolafe, O. O.; Roberts, A. C.; Shelley, J.; Tawiah, A. K.

2026-06-09 rehabilitation medicine and physical therapy 10.64898/2026.06.07.26355114 medRxiv

Top 3%

0.7%

Show abstract

High-quality, domain-specific datasets are foundational to advancing educational tools and AI systems in healthcare, yet assembling case repositories from real-world clinical records faces substantial privacy, ethical, and licensing barriers. Synthetic data generation offers a compelling pathway forward, but educational cases require rigorous validation to ensure clinical plausibility and pedagogical utility. This pilot study introduces PhysiCase, a dual-layer validation pipeline for synthetic case generation and evaluates the feasibility of combining automated LLM-based screening with expert educator review. We generated 128 synthetic musculoskeletal(MSK) cases using four frontier large language models (GPT-4.1, GPT-4o, Google Gemini 2.5 Pro, and Llama 4 Scout) across 28 clinical conditions. Cases underwent automated quality screening using an "LLM-as-judge" framework (DeepEval) assessing prompt alignment, JSON correctness, answer relevance, bias, toxicity, and completeness. Ninety cases (70.3%) passed automated filtering and proceeded to expert evaluation by four MSK physiotherapy educators, who rated medical accuracy, realism, fidelity, relevance, and usability on 5-point Likert scales. GPT-4.1 demonstrated the highest automated pass rate (96\%) and strongest expert ratings (medical accuracy 4.10/5, usability 4.38/5), while Llama 4 Scout showed the lowest pass rate (33.3%) and expert ratings. Expert-evaluated cases achieved strong content validity indices for usability (97.5%), relevance (97.5%), and realism (95%), though medical accuracy showed greater variance (CVI 87.5%). Cross-layer correlation analysis revealed that automated completeness metrics moderately aligned with expert usability ratings , while answer relevance and prompt alignment showed weak or negative correlations with clinical correctness. Qualitative analysis identified three primary failure modes: reductive logic, biomechanical inconsistency, and administrative/contextual gaps. The dual-layer validation framework proved methodologically viable: automated screening efficiently reduced expert review burden, while human judgment remained indispensable for detecting subtle clinical reasoning failures. LLM-generated synthetic cases has the potential to meet practical educational needs for MSK physiotherapy, but expert validation is essential to safeguard clinical accuracy. These findings support a scalable division of labour for synthetic case development, with targeted improvements to prompting and automated reasoning checks needed to address identified "nuance gaps." The code for this paper is available on https://github.com/kwid-ai/PhysiCase

10

Study Design Indexing in Transition: A Focused Comparison of manual NLM Indexing vs. Transformer-based Automated Models

Das, P.; Schneider, J.; Mayo-Wilson, E.; Kilicoglu, H.; Menke, J. D.; Nam, D.; Ninan, K.; Oberste, J.-P.; Troy, A. M.; Ying, X.; Holt, A. W.; Smalheiser, N. R.

2026-06-04 health informatics 10.64898/2026.06.03.26354854 medRxiv

Top 3%

0.7%

Show abstract

Objectives: Study design indexing of biomedical publications is crucial for evidence retrieval and synthesis. We sought to evaluate the accuracy and suitability of a transformer-based model (TM) for indexing clinical study designs, in comparison to National Library of Medicine (NLM) indexing. However, this is challenging for at least three reasons: First, to date, all automated systems have been trained and evaluated on manual NLM indexing assignments, itself subject to errors. Second, TM's probabilistic predictive scores take into account uncertainty, and can be converted to TRUE/FALSE assignments in different ways depending on the needs of users, while NLM labels are categorical. Third, our goal (to tag articles only that exhibit a given design) differs from NLM which tags articles that both discuss as well as exhibit that design. Materials and Methods: Therefore, we carried out a limited evaluation of the TM model that focuses only on the articles that received the most confident predictions, that is, the highest scores that are almost certainly TRUE and the lowest scores that are almost certainly FALSE, but which disagreed with NLM assignments. This was performed both for articles published in 2016 (when NLM decisions were manual) and in 2025 (when NLM decisions were automated). To establish ground truth, dual annotators indexed the articles independently, following written definitions, for four prominent study designs--cohort, case-control, cross-sectional, and case report. Results: For three designs (case-control, case report, cross-sectional), the articles having the top 100 predictive TM scores (when NLM failed to assign that design) were judged to exhibit that design in the great majority (86-100%) of cases. Conversely, the articles having the lowest 100 predictive TM scores (when NLM did assign the study design) exhibited the design only in relatively few (0-21%) of cases. The most confident predictions of the TM model were highly accurate and not redundant with automated NLM indexing; the exception was cohort studies articles, in which both TM and NLM labels showed high error rates of both omission and commission. Discussion and Conclusion: TM may have value for identifying articles exhibiting study designs, which is especially important for clinical decision-making as well as systematic reviews and other evidence syntheses. NLM indexing of cohort studies cannot be regarded as a reliable gold standard for training or evaluation of automated systems, warranting efforts to create a new manually annotated corpus.

11

Beyond Injection Detection: A Positive-Security Prompt Firewall that Closes the Scope and PHI Gap SOTA Classifiers Miss in Healthcare

Schwoebel, J.; Semenec, I.; Rousseva, J.; Frasch, M. G.; Thorstenson, R.; Bhatt, M.

2026-06-06 health systems and quality improvement 10.64898/2026.06.04.26354950 medRxiv

Top 3%

0.5%

Show abstract

Large language models embedded in autonomous agents process trusted instructions and untrusted data in one context window, leaving them open to direct and indirect prompt injection. In healthcare this is not hypothetical: a 2025 JAMA Network Open study found commercial medical LLMs followed injected instructions in 94.4% of simulated patient encounters, including life threatening recommendations . Yet the clinically decisive problem we quantify here is different. Most real clinical threats protected health information PHI exfiltration, cross patient access, bulk export, out of scope advice are fluent, legitimate looking requests that carry no attack signal, so even a state of the art injection detector passes them. Existing runtime guardrails trade safety against latency: model based auditors are accurate but add hundreds of milliseconds of Python inference, while lexical filters are fast but blind to obfuscated or semantically disguised payloads. We present QFIRE, an inline, provider agnostic prompt firewall implemented as a single self contained Rust toolchain proxy, CLI, and benchmark harness. QFIRE combines three mechanisms: (i) positive security scope constraints, which restrict a model call to a declared natural language purpose and block out of scope drift even when no overt attack token is present; (ii) an asynchronous detector graph that runs N rules and their detector nodes concurrently, cheapest checks first; and (iii) a de obfuscation pass that decodes Base64 hex ROT13, folds homoglyphs and leetspeak, and strips zero width characters before detection. QFIRE ships 106 versioned firewall rules and a dedicated HIPAA Safe Harbor 18 identifier PHI panel, and runs a local DeBERTa v3 injection classifier via embedded ONNX Runtime. On 1968 public prompt injection and jailbreak prompts QFIREs deterministic hybrid attains F1 0.86, statistically tied with Metas state of the art PromptGuard 2 0.86 and above protectai DeBERTa v3 0.83; lexical baselines lag 0.16 to 0.50. Our central result is on QFIRE HealthBench, a new 2000 prompt healthcare benchmark we build and release with real garak and Microsoft PyRIT payloads. There the same PromptGuard-2 recovers only 0.40 recall DeBERTa v3 0.57, because most clinical threats carry no injection signal; QFIREs combined scope plus PHI chain reaches 0.83 recall F1 0.87 at a calibrated 0.08 false positive rate. Generic injection detection, even state of the art, is therefore necessary but not sufficient for healthcare agents. A bare LLM judge also closes most of this static corpus gap F1 0.90; QFIREs contribution beyond static accuracy is auditable determinism, bounded latency, and adaptive robustness, where the bare judge falls to 34 to 59% recall section 5.5. End to end, placing QFIRE in front of a tool using agent over a mock EHR sandbox cuts the agents harmful action rate from 0.38 to 0.00 at a 0.13 benign utility cost. All code, rules, corpora snapshots, and scripts are released, and every table regenerates from a single make paper target against local models with no paid API keys.

12

Functionally Focused Evaluation: A Novel Comparative Protocol for Wearable Electroencephalography Headsets

Bhuyan, A.; Wong, M.; McEwan, A.; Higgins, C.; Cooray, N.

2026-06-05 radiology and imaging 10.64898/2026.06.03.26354802 medRxiv

Top 3%

0.5%

Show abstract

With the emergence of electroencephalography (EEG) as a tool in the cognitive domain, new demands are being placed on the technology to keep up with functional applications, especially in the context of at-home neural monitoring. New use cases have fostered development of wearable EEG (wEEG) devices: portable, low-cost headsets used for EEG monitoring. This evolution of technology and application has not been accompanied by development in technology evaluation, often relying on function-agnostic markers to assess devices for efficacy in this new space. With current methods limited in scope, this study designed, tested and evaluated a novel functionally-focused comparative protocol for wEEG devices. Eight participants undertook a protocol for the evaluation of four established wEEG devices, assessing cognitive resolution and general usability. Compared to a well-established traditional analysis method (eyes open/eyes closed protocol), the novel design proposed here enabled the same analysis of headset resolution, while also providing additional context into user preferences and opening downstream possibilities for specific cognitive insights. Future research could enable the development of this protocol into a standardised method to ensure the performance of wEEG technology can satisfy emerging clinical needs.

13

Assessing the Reliability of a Controllable Sound Source Driven Bowel Sound Monitoring Device in Physiological Tissue Acoustic Environments

Zhao, J.; Zhao, Z.; Huang, X.; Li, Y.; Wu, J.; Peng, S.; Wang, S.; Sun, G.; Luan, Z.

2026-06-04 gastroenterology 10.64898/2026.06.03.26354788 medRxiv

Top 3%

0.5%

Show abstract

Objective To verify the reliability of a self developed bowel sound monitoring device under real biological tissue acoustic propagation conditions using a controllable sound source, and to establish quantitative evidence for its translational applicability. Methods Freshly euthanized six month old Bama miniature pigs were used as an experimental model. A high fidelity Bluetooth audio playback device was implanted into the abdominal cavity to deliver manually annotated bowel sound recordings as controllable acoustic stimuli. A self developed bowel sound monitoring device was fixed on the abdominal surface for continuous signal acquisition. Playback timestamps were defined as the ground truth, and event level matching was performed within a predefined temporal tolerance window. Four performance indicators were evaluated: (1) bowel sound acquisition and energy amplification, (2) event matching accuracy, (3) acoustic feature consistency, and (4) subjective agreement assessed by blinded auscultation from gastroenterologists with different levels of clinical experience. Results The monitoring device exhibited stable detection capability and effectively covered the full spectral range of the original signals. It significantly enhanced bowel sound energy while preserving temporal and spectral characteristics, demonstrating high consistency in time and frequency domain features. Blinded clinician assessments showed a subjective agreement rate of 88.9% between original and surface recorded bowel sound events. Conclusions Under real tissue acoustic propagation conditions, the self-developed bowel sound monitoring device reliably captures bowel sound events with high temporal accuracy, acoustic fidelity, and clinical perceptual consistency. This controllable sound source based validation provides robust technical evidence for subsequent in vivo studies and clinical translation, supporting the development of objective and continuous gastrointestinal function monitoring.

14

Singular Value Decomposition-Based Coil Combination Improves the Accuracy and Noise-Robustness of Quantitative Susceptibility Maps

Atkins, C.; Wu, T.; Bujak, B.; Inati, S.; Kellman, P.; Nair, G.

2026-06-05 radiology and imaging 10.64898/2026.05.28.26354148 medRxiv

Top 3%

0.5%

Show abstract

Most high-field MRI scanners conduct imaging using phased-array coils, in which the signals received by an array of coil elements are combined for downstream processing. Optimally combining these signals requires knowledge of each coil's spatial sensitivity profile, which can be acquired from a volume coil with homogeneous sensitivity across the field-of-view. However, this approach is not often used on high-field MRI scanners, especially on non-clinical systems; therefore, this work uses an algorithm based on the singular-value decomposition (SVD), called SVD-B1, to estimate coil sensitivities directly from the array data itself. Images produced by SVD-B1 are devoid of wormhole artifacts and open-ended fringe lines commonly seen in more conventional reconstructions. Quantitative Susceptibility Maps (QSMs) produced using the algorithm were compared to those produced using other combination algorithms across clinically relevant regions of in-vivo and postmortem human brains. As progressive levels of simulated noise were added to the data, SVD-B1's QSMs were up to 3% (in-vivo) and 13% (postmortem) more consistent (as measured by their Intraclass Correlation Coefficient) than those from other algorithms. Additionally, these QSMs were up to 8.5% (in-vivo) and 36% (postmortem) more accurate than other QSMs with respect to a "single-coil" reference. A parallel imaging extension of SVD-B1, called SVD-B1 GRAPPA, achieved similar results for QSMs generated from progressively more accelerated acquisition data. These results show that SVD-B1 can improve the sensitivity of high-resolution QSM to subtle changes in fine-grained tissue structures (e.g., in neurodegenerative disease) and help reduce scan times in clinical settings where shorter scans are imperative.

15

Early assessment of potential airline-mediated importation risk during the 2026 DRC-Uganda Bundibugyo virus disease outbreak

Kinoshita, R.; Suzuki, M.; Yoneoka, D.

2026-06-09 public and global health 10.64898/2026.06.01.26354569 medRxiv

Top 3%

0.5%

Show abstract

During the 2026 Bundibugyo virus disease outbreak in the Democratic Republic of the Congo and Uganda, we projected potential airline-mediated importation risk using contemporary airline network and an externally calibrated Ebola importation hazard. Effective-distance analyses identified major international hub countries, including Belgium, France, South Africa, Kenya, and the United Arab Emirates, as higher-probability gateways within 30 days. These early projections provide a reproducible framework for real-time international situational awareness, while emphasizing that importation risk does not imply local transmission risk.

16

A Data-Driven Framework for Generating Population-Linked Case Vignettes from Nationwide Triage Data

Seidel, A.; Steiger, E.; Schuster, J.; Kroll, L. E.

2026-06-10 health informatics 10.64898/2026.06.08.26354886 medRxiv

Top 4%

0.4%

Show abstract

Background: Digital decision-support tools such as triage systems and symptom checkers support millions of health-related decisions each year. Their quality and safety are commonly evaluated using textual patient cases, known as case vignettes. However, existing vignette sets written by medical experts cover only a limited spectrum of real-world patient presentations and lack population weights, which would allow extrapolating evaluation results to the underlying patient population. Objective: This study aims to develop a data-driven framework for automatically generating a human-manageable set of case vignettes from nationwide triage data that captures broad presentation diversity and links each vignette to a quantitative weight reflecting the number of underlying patient assessments. Methods: From 3.2 million triage assessments conducted over one year using structured triage software in the German medical on-call service (telephone triage and online self-triage) and at the joint contact points of the outpatient emergency care service and hospital emergency departments, we randomly sampled 50,000 cases. Triage questionnaires were converted into semantic embeddings using a German Sentence Transformer Model and grouped by agglomerative clustering. For clusters containing sufficient assessments, we generated one representative assessment using a two-phase simulated-annealing optimization. The optimization minimized the distance to the cluster centroid while maximizing the number of answered triage questions, aiming for high representativeness and information content. Each representative assessment was assigned the size of its source cluster as its sample-based weight. A similarity-based sensitivity analysis was performed to examine whether these weights were preserved in the full 1-year population. Finally, the question-answer pairs of the representative assessments were converted into structured textual case vignettes using controlled prompting of a large language model. Results: The cluster analysis yielded 514 included clusters covering 96.8% of the sampled 50,000 assessments. The generated representatives showed strong agreement with the majority treatment-urgency recommendation of their source cluster (Spearman's {rho}=0.78, p<0.001) and contained on average 4.3 more answered triage questions than the original assessments within their clusters. When weighted by cluster size, the representatives approximated the sample distributions of treatment urgency, demographics, and symptoms, although some systematic deviations remained, most notably an overrepresentation of female cases (+13.5%), patients aged 14-49 years (+8.0%), and the urgency category "As soon as possible" (+6.6%). Of 121 recorded symptoms, 101 (83.5%) were covered by the representatives; the rest each occurred in <0.5% of the sample. In a sensitivity analysis, cluster-based vignette weights were strongly correlated with similarity-based population weights (Spearman's {rho}=0.77, p<0.001), and 90.1% of assessments in the full 1-year population were matched to at least one vignette. Conclusions: We present a data-driven framework for deriving a manageable set of population-weighted case vignettes from nationwide triage data. The resulting vignettes captured broad presentation diversity, approximated key sample characteristics, and provided an explicit quantitative link to the number of underlying patient assessments. After medical expert review and refinement, the vignettes may support more population-aware evaluation and quality assurance of digital decision-support tools.

17

Comparative Evaluation of Mosquito Repellent Products in South Asia and North America: Efficacy, Safety, and Public Health Implications

Sahal, K.; Amin, S. M. A.; Mostafa, T.; Wang, S.; Colucci, B.; Shafoyat, M. U.; Yuan, Z. -m.; Cheng, G.

2026-06-08 toxicology 10.64898/2026.06.07.26355094 medRxiv

Top 4%

0.4%

Show abstract

Mosquito-borne diseases continue to pose significant public health challenges worldwide, particularly in densely populated regions of South Asia and parts of North America experiencing increasing vector prevalence due to climate and environmental changes. Commercial mosquito repellents are widely used as a primary preventive measure; however, their efficacy, safety, and public health impacts vary depending on formulation, active ingredients, environmental conditions, and user practices. This study presents a comparative evaluation of commonly used mosquito repellent products in South Asia and North America, including coils, vaporizers, sprays, creams, and Natural repellents. The research aims to assess repellent efficacy against major mosquito vectors, evaluate potential health and respiratory effects associated with prolonged exposure, and analyze consumer awareness and usage patterns across different regions. Laboratory-based efficacy testing and field observations were conducted to compare protection duration, repellency rate, and environmental performance under varying climatic conditions. Safety assessments included analysis of chemical composition, indoor air quality impact, and reported adverse health symptoms among users. The findings indicate significant differences in effectiveness and safety profiles among product categories and geographical regions. Synthetic repellents generally demonstrated higher repellency duration, while herbal formulations showed improved safety and environmental compatibility. The study highlights the importance of standardized evaluation protocols, regulatory oversight, and public awareness in promoting safe and effective mosquito control strategies. These findings may support policymakers, healthcare professionals, and manufacturers in improving mosquito repellent technologies and reducing the burden of mosquito-borne diseases globally.

18

A Hierarchical Visual EEG Framework for the Assessment of Disorders of Consciousness

Chen, Y.; Ge, Q.; Li, H.; Kang, X.; Chen, Q.; He, W.; Sun, Y.; Zhang, S.; Laureys, S.; Chen, X.; He, J.; Gao, X.

2026-06-05 neurology 10.64898/2026.06.04.26354678 medRxiv

Top 5%

0.3%

Show abstract

The objective assessment of patients with disorders of consciousness (DOC) remains a significant clinical challenge. Behavioral scales like the Coma Recovery Scale-Revised (CRS-R) are susceptible to rater subjectivity and have difficulty in detecting patients with cognitive-motor dissociation (CMD), while existing electrophysiological paradigms typically evaluate isolated processing levels, especially in visual functions. To address these limitations, we developed a novel, hierarchical visual EEG framework that evaluates three progressive tiers of visual processing--sensory input, selective attention, and object discrimination--within a single, unified paradigm. This framework uses steady-state and event-related potentials, analyzed with statistical testing and machine learning, to provide objective detection. In a cohort of 85 participants, the framework demonstrated a robust alignment with behavioral CRS-R levels and successfully identified CMD patients missed by bedside behavioral examinations. Notably, model predictions derived from this framework showed a significant correlation with 3-month clinical outcomes. This prognostic utility generalized effectively and remained consistent across distinct EEG acquisition systems in an independent validation cohort of 17 patients. In summary, this work offers electrophysiological validation for the hierarchical design of the CRS-R and provides a practical tool for bedside objective assessment of DOC.

19

Multimodal approach to identify neuropsychophysiological subgroups in myalgic encephalomyelitis/chronic fatigue syndrome and their relevance for rehabilitation: protocol for a mechanistic cross-sectional and longitudinal study

Dooms, Y.; Qiu, L.; Coppieters, I.; Vergaelen, E.; Claes, S.; Dupont, P.; Hehl, M.; Cuypers, K.; Engler, H.; Dombrowski, K.; Verbeke, K.; Van den Bergh, O.; Raes, J.; Van Oudenhove, L.; Van Den Houte, M.; Bogaerts, K.

2026-06-08 neurology 10.64898/2026.06.05.26354983 medRxiv

Top 5%

0.3%

Show abstract

Introduction: Myalgic Encephalomyelitis (ME)/Chronic Fatigue Syndrome (CFS) is a debilitating condition characterised by severe fatigue and post-exertional malaise (PEM). Reported neuropsychophysiological abnormalities suggest ME/CFS is multifactorial, but current knowledge remains fragmented. This study protocol outlines a multimodal investigation designed to (1) compare neuropsychophysiological mechanisms between ME/CFS patients and healthy participants, (2) test an integrative model of ME/CFS, (3) identify neuropsychophysiological subgroups within the patient population, and (4) identify predictors of symptom response during rehabilitation. Methods and analysis: This study will enroll 115 ME/CFS patients and 55 healthy participants. Groups will be comparable in age, sex, and education level, with a larger patient sample enabling subgroup and longitudinal analyses. A cross-sectional assessment at baseline will be carried out in both groups. Patients will then be evaluated longitudinally throughout a standardized cognitive-behavioral therapy rehabilitation program delivered as routine care. Baseline measures include systemic inflammation and general health biomarkers, measures of autonomic and central nervous system function, neuroinflammation (magnetic resonance spectroscopy, [18F]DPA714 PET in a subsample), serum short-chain fatty acid levels, gut microbiota composition and function, and neuroendocrine and self-reported responses to psychosocial stress. Fatigue severity (physical and cognitive) and PEM will be assessed through validated questionnaires, ecological momentary assessment, and laboratory tasks. These will be re-evaluated during therapy, and all non-neuroimaging measures will be repeated after the rehabilitation program. Statistical analyses will comprise multivariate analysis of variance, general linear models, classification algorithms, structural equation models, least absolute shrinkage selection operator principal component regression (LASSO-PCR), cluster analysis and latent class growth analysis (LCGA).

20

Tune In or Take the Stage? A Randomized Controlled Trial Comparing After-School Music and Theatre Training with Neuroimaging Outcomes for Youth

Jamey, K.; Herschel, E.; Noel, C.; Villanueva, J.; Reyes, M.; Hsu, E.; Ilari, B.; Mack, W.; Luo, S.; Habibi, A.

2026-06-05 public and global health 10.64898/2026.06.03.26354844 medRxiv

Top 5%

0.3%

Show abstract

Introduction: While growing evidence suggests that music training supports child development, few long-term randomized controlled trials (RCTs) have rigorously tested these claims. Moreover, it remains unclear whether the benefits are confined to music-specific domains or extend to higher-order cognitive functions such as inhibitory control (IC), a core executive function associated with long-term outcomes in academic achievement, career success, socio-emotional health, and physical well-being. This paper presents the protocol for the Extracurricular Activity and Child Early Learning and Development (EXCEL) trial, an RCT designed to assess the feasibility of a long-term music training program focusing on the brain and behavioral correlates of IC. Methods: A total of 126 children, aged 6 to 8 years and residing in neighborhoods with limited resources in Los Angeles, were individually randomized to either a music (intervention) or theatre (active control) after-school program. Both programs were delivered over 24 months by established community arts organizations. Eligibility criteria included: average intellectual functioning, no major medical or psychiatric conditions, and MRI eligibility. Children with prior formal music training exceeding six months or severe hearing impairment were excluded. Before the intervention began, all participants completed baseline behavioral and neuroimaging assessments. The primary trial aim was to assess the effects of extended music training, relative to theatre training, on changes in measures of IC (i.e., Go/No-Go task and delayed gratification) and related neural functional activation. A secondary interim aim of the trial was to evaluate the feasibility of conducting a long-term RCT of music education in a first cohort, measured by participant retention, adherence to the program, willingness to continue at the 12-month mark, and fidelity. Progress: Recruitment, screening, baseline testing, randomization, and program enrollment began in August 2022, and after-school programming began in October 2022. The randomized interventions and all data for the first cohort (N = 42) have been collected. Intervention and active control programs for a second cohort are ongoing and will end in Fall 2026. Discussion: This paper reports the EXCEL trial protocol and provides feasibility estimates for implementing a long-term randomized controlled trial of music training in real-world, community-based settings with children. While similar neuroimaging RCTs are currently underway in Europe, the EXCEL trial is among the first in the United States to integrate longitudinal neuroimaging with arts intervention. Findings will inform the viability of scaling such programs and contribute to our understanding of how sustained music engagement may influence the development of inhibitory control circuitry in childhood.